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RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, 703-308-4212. 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office,: and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://vmw.uspto.gov/ebc/efs/downioads/documents.htm , EFS Submission 

User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22^02 

Or . / 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, L,pbby, Roonf 1B03, Crystal Plaza Two, 
2011 South Clark Place, Arlington,- VA 22202 ; ^ 

4. Federal Express, United Parcel Seryice, or dther delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 * 
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EST AVAILABLE COPY 



Raw Sequence Listing Error Summary 



R: 



KRROR DETECTED 



SUCCESTED CORRECTION 



SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD EN C LIS II "ALPHA" HEADERS, WHICH AJ[ERE INSERTERS Y PTO SOFTWARE 

v • :< 

I Wrapped Nucleic* The numberAexl at the end of each lino # Vnppe<T down to ihcnctf line. ThU may occur if your file 

Wrapped Aminos waj retrieved in a word processor after aeating H. Please adjust your right margin to .3; thii will 
prevent "^wrapping/* . " 

2 J Invalid Une Length The rules require that a line not eicecd 72 character* in length. This includes white spaces. 



Misaligned Amino 
Numbering 



4 Non-ASCII 

7- . -in. 
„ Variible Length 



6 Patentln 2.0 

"bug" 



_Skippcd Sequences 
(OLD RULES) 



Skipped Sequences 
"(NEW RULES) 



The numbering under each 3* ammo acid is misaligned. Do not use lab codes between numbers* 1 
use space characters, instead. 

The submitted Hie was not saved in ASCn(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is **ved In ASCII leiL y +r = ' 

Scoucncofs f^~ contain n's or XaVs representing more than one residue. Per Sequence Rules, 
each n orXaa can only represent a tingle residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section-that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be nussing from amino acid 

sequences) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Pleaac manually copy the relcvint <220>-<223> section to 
the subscquentamino acid sequence. TMt applies to the mandatory <220>-<223> sections for 
Artificial or OftVrrfrrrn sequences,' 

Sequence<s) ^ missing. If intentional, please insert the following lines for each skipped sequence: 
(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where M X" is shown) 
This sequence is intentionally skipped 

Please also adjust the M (ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



Sequencc<s)_ 



<2 10> sequence id number 
<400> sequence id number 
000 



missing. If Intentional, please insert the following lines for each skipped sequence. 



10 



11 



12 



Use of n's or Xaa's Use of n*s and/or Xaa's have been detected in the Sequence Listing. 



(NEW RULES) 



_Invalid<2!3> 
Response 



Use of <220> 



Patentln 2.0 
"bug" 



Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 
In <220> lo <223> section, please explain location of n or Xaa, and which residue n or Xa* represents. 
A *- 

Per 1.S23 of Sequence ffulc\ the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Oenus/species)L <220>-<223> section iTrequIred when <2 13> response is Unknown or t i 
lis Artificial Sequence t. r 



Sequencers) _ 



missing the <220> -Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <2 13> -Organism'' response is "Artificial Sequence'' or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register,- 06701/1998, VbL 63, No, 1 04. pp. 29631-32) (Sec. 1.S23 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes & corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use Tile Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n 



a can only be used to represent a sirfglo nucleotide in a nucleic acid sequence. N is not used to reprcserst 
any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/2 1/2001 
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4 


<110> 


5 




7 


<120> 


W — > 8 


<130> 


10 


<140> 


11 


<141> 


13 


<150> 


14 


<151> 


16 


<150> 


17 


<151> 


19 


<160> 


21 


<170> 



RAW SEQUENCE LISTING DATE: 07/18/2003 

PATENT APPLICATION: US/09/554,533 TIME: 13:08:11 

Input Set : A:\EP.txt 

Output Set: N:\CRF4\07182003\l554533.raw 

APPLICANT: BEELEY, NIGEL ROBERT ARNOLD 
PRICKETT, KATHRYN S. 

TITLE OF INVENTION: NOVEL EXENDIN AGONIST COMPOUNDS 
FILE REFERENCE: 238/086 US (030639 . 0043 . UTL2) 

CURRENT APPLICATION NUMBER: 09/554,533 

CURRENT FILING DATE: 1998-11-13 

PRIOR APPLICATION NUMBER: PCT/US98/24210 

PRIOR FILING DATE: 1998-11-13 

PRIOR APPLICATION NUMBER: US 60/065,442 

PRIOR FILING DATE: 1997-11-14 

NUMBER OF SEQ ID NOS : 74 

SOFTWARE: FastSEQ for Windows Version 3.0 



ERRORED SEQUENCES 

68 <210> SEQ ID NO: 

69 <211> LENGTH: 30 

70 <212> TYPE: PRT 



71 <213> ORGANISM: Homo sapien 003S Not I 

72 <220> FEATURE : COTOCtOd D/sf^ffn |U™ 

73 <221> NAME/KEY: AMI DAT ION Ly ' &K9 «0 WeeCfeC 



74 <222> LOCATION: (30)... (30) 

75 <223> OTHER INFORMATION: amidated Arg (Argininamide) 
77 <4 00> SEQUENCE: 3 



11 <4 00> SEQUENCE: 3 - 0 Jj 

7 9 His Ala Glu Gly Thr Phe Thr Ser Asp Val Ser Ser Tyr Leu Glu Gly fan^ O^C^ffi 



82 Gin Ala Ala Lys Glu Phe He Ala Trp Leu Val Lys Gly Arg 2&~~ ' ^ _ 

E— > 83-25- ZO ^ IS 3o -jn ?farr\ 3 

85 <210> SEQ ID NO: 4 ^ * 



86 <211> LENGTH: 29 (fy\ £/LAM\ M^rr****] 

87 <212> TYPE: PRT tj y 

88 <213> ORGANISM: Artificial Sequence 

90 <220> FEATURE: 

91 <223> OTHER INFORMATION: artificially synthesized sequence of novel exendin agonist 

92 compound 

94 <220> FEATURE: 

95 <221> NAME/KEY: VARIANT 

96 <222> LOCATION: (1)...(7) ' 

97 <223> OTHER INFORMATION: Xaa in position 1 is His, Arg or^Tyr; Xaa in position 2 is 

98 Ser, Gly, Ala or Thr; Xaa in position £ is Asp or Glu; 

99 Xaa in position^S^ is Ala or^Thr; Xaa in position 6 is Ada, 

100 Phe, Tyr or naphthylalanine; Xaa in position 7 is Thr or Ser; 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/554,533 



DATE: 07/18/2003 
TIME: 13:08:11 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\07182003\l554533.raw 




Ala or Gluy' Xaa in position 17 is Ala or Glu; Xaa in position 
19 is Ala^or Val; Xaa in position 20 is Ala/ or Arg; 



102 <220> FEATURE: 

103 <221> NAME/KEY: VARIANT 

104 <222> LOCATION: (8)... (13) ■ * / 

105 <223> OTHER INFORMATION: Xaa in position 8 is Ala, Ser or Thr; Xaa in position 9 is 

106 - Asp or Glu; Xaa in position 10 is Ala, Leu, lie, Val, ^-pentyl- 

107 glycine or Met; Xaa in position 11 is Ala or Sep; Xaa in 

108 position 12 is Ala or Lys; Xaa in position 13 is Ala or Gin; 

110 <220> FEATURE: 

111 <221> NAME /KEY : VARIANT 

112 <222> LOCATION: (14)... (20) f 

113 <223> OTHER INFORMATION: Xaa in position 14 is Ala, Leu, He, pentylglycine, Val or 

114 Met; Xaa in position 15 is Ala or Glu; Xaa y in position 1,6 is 
115 
116 

118 <220> FEATURE: 

119 <221> NAME /KEY : VARIANT 

120 <222> LOCATION: (21)... (24) 

121 <223> OTHER INFORMATION: Xaa in position 21 is Ala or Leu; Xaa in position 22 is Phe, 

122 Tyr or naphthyl alanine; Xaa in position 2,3 is He, Val, Leu, 

123 pentylglycine, tert-butylglycine or Met; Xaa in position /24 

124 is Ala, Glu or Asp; ( 

126 <220> FEATURE: 

127 <221> NAME /KEY : VARIANT 

128 <222> LOCATION: (25)... (28) . 

129 <223> OTHER INFORMATION: Xaa/in position 25 is Ala, Trp, Phe, Tyr or naphthylalanine; 

130 Xaa in position 26 is IKYa. or Leu; Xaa in position 27 is/ Ala 

131 or Lys; Xaa in position 28 is Ala or/Asn; 

133 <220> FEATURE: 

134 <221> NAME/KEY: VARIANT u '^Z~l ~ I ' . * 

135 <222> LOCATION: (29) . . . (29) ' 



/ 



Gl-y->Z2; Gl^ Gly-Z2;/Gly Gly 



136 <223> OTHER J^TORMATION : Xaa in positiofu-29 is\^0^\^NHgy Gl-^sZ-, ^ „ 

137 ^ ( xaa31-7 2jigly Gly Xaa31^feer-Z2 ; <^ Gly Gly Xaa3l ) _Ser Ser-^ CGly^l^ / ^ * 

138 ( ] p£a31 Ser SeT^CTy=^ XTgl ygly Xaa3j^Ser Ser Gly Ala-Z^jpgl^T"gTy 

139 (Xaa31 S er Ser Gly~lQa XaT3^=^2p ' 

141 <220> F EATURE: n y 

142 <221> NAME/KEY: VARIANT (_ $ 

143 <222> LOCATION: (29) . . . (2 

144 <223> OTHER INFORMATION: 



145 u <§er 



147 
148 
149 
150 
151 
152 
154 




Ser Gly Ala Xaa36 Xaa37 Xaa38-Z2; 

<220> FEATURE-: . 

<221> NAME /KEY : VARIANT n ~, 

<222> LOCATION: (29) . . . (29) ' £ 

<223> OTHER INFORMATION: where Xaa31, Xaa36, Xaa37 and Xaa38 are independently Pro, 
homoproline, 3Hyp, 4Hyp, thioproline, N-alkylglycii)^^ ^ 



. AM- 

4 




6, 



N-alkylpentylglycine or N-alkylalanine; and Z2 is 
<220> FEATURE: 

155 <221> NAME/KEY: VARIANT 

156 <222> LOCATION: (3)... (28) 

157 <223> OTHER INFORMATION: provided that no more than three of Xaa in positions 3, 



5, 
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RAW SEQUENCE LISTING DATE: 07/18/2003 

PATENT APPLICATION: US/09/554,533 TIME: 13:08:11 

Input Set : A:\EP.txt 

Output Set: N:\CRF4\07182003\l554533.raw 



158 

159 27 and 28 are Ala. 



E 



8flp, LI, 12', 1,3, 14, 15, 1,6, 17, 19, 2.0, 2>", 24, 2-5, 26, L^AJ^ ^ 

27 and 28 are Ala. * /ill 

161 <4 00> SEQUENCE: 4 , / Y\(jLW. fm*fv\ . 

162 Xaa xia Xaa Gly Xa4 Xa'a Xa'a Xafe Xaa' Xaa' Xa'a Xa'a Xaa Xaa Xa<a Xaa 1 /* J ^ 0 u JiM £oJ 
,-> 163 5 ^ t ' * 10 ' / ' t \5 f ' / ' ' ' MM.-h**?***' 
W — > 164 Xaa Ala Xaa Xa'a Xaa Xaa Xaa Xaa 7 Xaa Xaa Xaa Xaa Xaa 7t/* /Lr**rij 
E->165 . / 2'0 '■ * < ~& / ' f 

168 <210> SEQ ID NO: 5 

169 <211> LENGTH: 30 /M-JU<\ JLaj4aM s 

170 <212> TYPE: PRT ^ \ 

171 <213> ORGANISM: Artificial Sequence ^ ^rr^o O-CS*^ 

173 <220> FEATURE: 

174 <223> OTHER INFORMATION: artificially synthesized sequence of novel exendin agonist 

175 compound 

177 <220> FEATURE: 

178 <221> NAME /KEY : AMI DAT ION y ' *L - 

179 <222> LOCATION: (30) . . . (30) />^>a>*>M 



* 180 <223> OTHER INFORMATION: amidated Gly (Glycinamide ) / [\&rJ< tftk*fn 

182 <400> SEQUENCE: 5 W 

183 His Gly Glu Gly Thr Phe Thr Ser Asp Leu Ser Lys Gin Met Glu Glu 1 / m 
E— > 184 5 ' 10 15 Jj 

186 Glu Ala Val Arg Leu Phe lie Glu Trp Leu Lys Asn Gly Gly ^20 /7y%oJL^& 0~es-J</ 

E— > 187 25 30 CAyrr^y 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/09/554,533 



DATE: 07/18/2003 
TIME: 13:08:13 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\07182003\I554533.raw 



Invalid Line Length: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:3; Line(s) 82 
Seq#:4; Line(s) 162 
Seq#:5; Line(s) 183,186 
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VARIABLE LOCATION SUMMARY DATE: 07/18/2003 

PATENT APPLICATION: US/09/554,533 TIME: 13:08:13 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\07182003\I554533.raw 



Use of n's or Xaa's (NEW RULES) : 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

in <220> to <223> section, please explain location of n or Xaa, and which 

residue n or Xaa represents. 



Seq#: 


:4; ! 


Xaa Pos. 


17, 19,20,21, 


Seq#; 


:47; 


Xaa 


Pos. 


31,36, 37, 38 


Seq#: 


:48; 


Xaa 


Pos. 


36, 37, 38 


Seq#: 


:51; 


Xaa 


Pos. 


31, 36, 37 


Seq#: 


:52; 


Xaa 


Pos. 


31,36 


Seq#: 


:55; 


Xaa 


Pos. 


6 


Seq#: 


:59; 


Xaa 


Pos. 


10 


Seq#: 


:60; 


Xaa 


Pos. 


22 


Seq#: 


:61; 


Xaa 


Pos. 


23 


Seq#: 


:65; 


Xaa 


Pos. 


31, 36, 37 


Seq#: 


:66; 


Xaa 


Pos. 


1,2,3,5,6,7 


Seq#: 


:66; 


Xaa 


Pos. 


25,26, 27, 28 
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VERIFICATION SUMMARY DATE: 07/18/2003 

PATENT APPLICATION: US/09/554 , 533 TIME: 13:08:13 

Input Set : A:\EP.txt 

Output Set: N:\CRF4\07182003\l554533.raw 

L:8 M:283 W: Missing Blank Line separator, <130> field identifier 
L:72 M:283 W: Missing Blank Line separator, <220> field identifier 
L:83 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 3 
L:83 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 30 Found:16 SEQ:3 
L:163 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID: 4 
L:164 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos . : 0 
M:332 Repeated in SeqNo=4 

L:165 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 29 Found:13 SEQ:4 
L:184 M:332 E: (32) Invalid/Missing Amino Acid Numbering, SEQ ID:5 
M:332 Repeated in SeqNo=5 

L:187 M:301 E: (44) No Sequence Data was Shown, SEQ ID:5 

L:187 M:252 E: No. of Seq. differs, <211> LENGTH : Input : 30 Found:0 SEQ:5 



L: 


1186 M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


47 


after 


pos . 


:16 


M: 


341 Repeated 


in SeqNo= 


47 




















L: 


1218 M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


48 


after 


pos . 


:32 


L: 


1302 M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


51 


after 


pos . 


:16 


M: 


341 Repeated 


in SeqNo= 


51 




















L: 


1331 M:341 W: 


(46) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


52 


after 


pos . 


:16 


M: 


341 Repeated 


in SeqNo= 


52 




















L: 


1405 M:341 W: 


(46) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


55 


after 


pos . 


:0 


L: 


1500 M:341 W: 


(4 6) "n" 


or . 


"Xaa" 


used, 


for 


SEQ 


ID#: 


59 


after 


pos . 


:0 


L: 


1529 M:341 W: 


(4 6) "n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID#: 


60 


after 


pos . 


:16 



L:1546 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:61 
L:1553 M:258 W: Mandatory Feature missing, <222> Tag not found for SEQ ID#:61 
L:1553 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:61 after pos.: 16 
L:1615 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:64 
L:1640 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:65 
L:1647 M:258 W: Mandatory Feature missing, <222> Tag not found for SEQ ID#:65 
L:1647 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:65 after pos.:16 
M:341 Repeated in SeqNo=65 

L:1743 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:66 after pos . : 0 
M:341 Repeated in SeqNo=66 
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